SOME OPEN PROBLEMS IN RANDOM MATRIX THEORY AND 
THE THEORY OF INTEGRABLE SYSTEMS 



PERCY DEIFT 



Abstract. We describe a list of open problems in random matrix theory and 
integrable systems which was presented at the conference "Integrable Systems, 
Random Matrices, and Applications" at the Courant Institute in May 2006. 



Introduction 

During the conference "Integrable Systems, Random Matrices, and Applica- 
tions," held at the Courant Institute in May 2006, the organizers asked me to 
present a list of unsolved problems. What follows is, more or less, the list of prob- 
lems I presented, written down in an informal style. Detailed references are readily 
available on the web. In the text, various authors are mentioned by name: this is 
to aid the reader in researching the problem at hand, and is not meant in any way 
to be a detailed, historical account of the development of the field. I ask the reader 
for his'r indulgence on this score. In addition, the list of problems is not meant to 
be complete or definitive: it is simply an (unordered) collection of problems that 
I think are important and interesting, and which I would very much like to see 
solved. 

Problem 1. KdV with almost periodic initial data. Consider the Korteweg- 
de Vries (KdV) equation 

(1) ut + uu^ + u.^^.^ ^ 
with initial data 

(2) u{x,t ^0) = uoix), xeR. 

In the 1970s, McKean and Trubowitz proved the remarkable result that if the initial 
data uq is periodic, uo{x + p) = uo{x) for some p > 0, then the solution u{x,t) of 
([1]) is almost periodic in time. The conjecture is that the same result is true if uo{x) 
is almost periodic. This turns out to be an extremely difficult problem: for such 
initial data, even short time existence for ([T]) is not known. Loosely speaking, the 
difficulty centers around the fact that, in the periodic case, various quantities 
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are conserved, e.g. 

rP rP 

/ u^{x,t)dx= / ul{x)dx, t>0, 
Jq Jo 

but in the almost periodic case, we only know that averaged quantities, such as 
linii^oo J^i^ u^{x,t)dx, are conserved. It is not clear how to use such averaged 
quantities to analyze the Cauchy problem for Furthermore, the result of 
McKean and Trubowitz relies in an essential way on the integrability of KdV: in 
particular, H{t) — —-^ + u{x,t) undergoes an isospectral deformation under (fl]). 
and the periodic spectrum of H{t) provides integrals for the flow. In the almost 
periodic case, assuming that a solution u{x, t) exists for t > 0, the same is true for 
spec{H{t)), which equals spec{H{t = 0)); the problem is that very little is known 
about spec{H{t — 0)) for general, almost periodic uq{x). The spectrum may have 
absolute continuous, singular continuous, and pure point components. For example, 
in famous work from the 1970s, Dinaburg and Sinai used KAM techniques to show 
that H ~ — ^^+cosa; + cos(-\/2a;) has (some) absolutely continuous spectrum. And 
in the 1980s Moser constructed an example of a limit periodic potential uo{x) for 
which _ff = — + uq (x) has Cantor spectrum. And even if the spectral theory of 

~lx^ +uo(a^) for general almost periodic uo{x) was known, that would only be the 
first step in understanding the short time, and then the long time, behavior of the 
solution of ID). 

Problem 2. Universality for Random Matrix Theory (RMT). Univer- 
sality for unitary matrix ensembles (/3 = 2) in the bulk and at the hard and soft 
edges is now rather well understood. For orthogonal and symplectic ensembles 
{/3 — 1 and /3 = 4 respectively) universality in the bulk and at the hard and soft 
edges is now understood in the case that the underlying distribution is of the form 

1 g-trQ(A/) ^j^jjj.^, 

(3) Q(x) = aox" + aia;"~i + • • • 

is a polynomial. It is of considerable interest to prove universality in the cases 
/? = 1 and (3 — 4 ioi more general potentials Q{x), in particular for Q{x) of the 
form 

(4) Q{x)^NV{x), 

where V{x), is, say, a polynomial. In case ([3]), the associated equilibrium measure is 
supported on a single interval, whereas in case ^ the equilibrium measure may be 
supported on a union of disjoint intervals, and this makes the problem considerably 
harder. Following Tracy- Widom and Widom, one still utilizes known asymptotics 
for polynomials orthogonal with respect to the weight e^^^^^'' dx on R, but now 
one works on a Riemann surface of genus 5 > as opposed to the Riemann sphere 
in case ([3|). The real difficulty involves a certain determinant. Dm say, that arises 
in the analysis. From the algebraic point of view, Dm can be viewed as controlling 
the change of basis from orthogonal polynomials to skew orthogonal polynomials, 
as the size N of the polynomials goes to infinity. From an analytical/physical point 
of view. Dm arises in the computation of a ratio of partition functions, again in 
the large N limit. The essential task here is to show that Dm 7^ 0. In case ([3]) 
this follows from a lengthy ad hoc calculation; in the case the problem is wide 
open, though recent work of Shcherbina may point the way. Alternatively, Lubinsky 
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has recently developed very interesting new methods for unitary ensembles. Can 
these methods be adapted to prove universality for the orthogonal and symplectic 
ensembles? 

For Wigner ensembles, universality at the edge (Soshnikov,...) is now well un- 
derstood for a wide variety of distributions on the entries of the matrices. It is a 
long-standing conjecture, now more than forty years old, that universality is also 
true in the bulk, but there has been very little progress. At the edge, moment 
methods are powerful enough to prove universality, but in the bulk they provide no 
information. By contrast, for invariant ensembles of random matrices, one is able 
to prove universality in the bulk because of the availability of explicit formulae for 
eigenvalue statistics that are amenable to asymptotic analysis. 

A priori, universality is more plausible for unitary, orthogonal and symplectic 
ensembles because of the invariance properties that are built in: such ensembles 
are already "part of the way there." This is not the case, however, for Wigner 
ensembles. Universality in the bulk for Wigner matrices is a conjecture par excel- 
lence that digs deep into the structure of random matrices. Numerical experiments 
provide convincing evidence that it is true. 

Problem 3. Riemann-Hilbert Problem with non-analytic data. In many 
situations one is concerned with the asymptotic behavior of Riemann-Hilbert prob- 
lems with exponentially varying data of the form e™'^^^V(z), n oo. The Dcift- 
Zhou nonlinear steepest descent method for such problems requires ^(2;) to be 
analytic. The analyticity is used in two ways: to control the equilibrium measure 
associated with the problem, and tlicni to deform the contour for the RHP. By con- 
trast, the method only requires minimal smoothness for r{z) (see eg. Deift-Zhou 
in the context of Problem 12 below). It is of considerable theoretical and practical 
interest to extend the nonlinear steepc;st descent method to situations where (p is no 
longer analytic, and has, for example, only a finite number of derivatives. For very 
interesting work on the analyticity problem, we refer the reader to a recent paper 
of Miller and McLaughlin. There is also interesting, older work due to Varzugin. 

Problem 4. Painleve equations. What I have in mind here is not a specific 
problem, but a project, a very large scale project. The six (nonlinear) Painleve 

equations form the core of "modern special function theory." The role that the clas- 
sical special functions, such as the Airy, Bessel, and Legendre functions, started to 
play in the 19*^ century, has now been greatly expanded by the Painleve functions. 
Increasingly, as nonlinear science develops, people are finding that the solutions 
to an extraordinarily broad array of scientific problems, from neutron scattering 
theory, to PDEs, to transportation problems, to combinatorics,..., can be expressed 
in terms of Painleve functions. What is needed is a project, similar to the Bate- 
man project, or a new volume of Abramowitz and Stegun, devoted to the Painleve 
equations. Much can be, and has been, proved regarding the algebraic and as- 
ymptotic properties of Painleve functions. Here the role of integral representations 
and the classical steepest descent method in deriving precise asymptotics and con- 
nection formulae for the classical special functions is played, and expanded, by a 
Riemann-Hilbert representation of the Painleve equations, together with the non- 
commutative steepest descent method introduced in 1993. Very little is known, 
however, beyond ad hoc calculations, about the numerical solution of the Painleve 
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equations. If u(x) is the solution of the Painleve II equation, say, which is as- 
ymptotic to the Airy function Ai{x) as a; — » +00, one would like to know, for 
example, the location of its poles in the complex x-plane. A modern "Bateman 
Project: Painleve equations" would not/should not provide tables for such solu- 
tions. Rather, it should provide reliable, easy to use software to compute the 
solutions. Writing useful software for such nonlinear equations presents many chal- 
lenges, conceptual, philosophical and technical. Without the help of linearity, it is 
not at all clear how to select a broad enough class of "representative problems." 
The software should be in the form of a living document where new numerical 
problems can be addressed by a pool of experts as they arise. And at the technical 
level, how does one combine asymptotic information about the solutions obtained 
from the Riemann-Hilbert problem, together with efficient numerical codes in order 
to compute the solution u{x) at finite values of x? 

I believe that the importance of the Painleve Project will only grow with time. 
It should be viewed as creating a national resource and should probably be funded 
and led at the national level. The NIST Project "Digital Library of Mathematical 
Functions", where Peter Clarkson has a contribution on Painleve functions, is an 
encouraging first step. 

Problem 5. Multivariate analysis. Random matrix theory was introduced 

into theoretical physics by Wigncr in the 1950s in his study of neutron scattering 
resonances, but as a subject, RMT goes back to the work of statisticians at the 
beginning of the 20th century. Recently, advances in RMT have opened the way 
to the statistical analysis of data sets in cases where the number of variables is 
comparable to the number of samples, and both are large. One might, for example, 
be interested in the daily temperature in hundreds of cities around the world, over 
a 365 day time period. At the technical level, one considers the statistics of the 
singular values of (appropriately centered and scaled) p x n matrices M = (Mij), 
where p ~ n ^ 00. Hero p is the number of variables and n is the sample size. 
More precisely, one centers the Mij's around their sample averages, 

1 " 

Mij Mij = Mij -- ^Mik, 

' k=l 

and considers the eigenvalues h > ■ ■ ■ > Ip > and associated eigenvectors 
wi, . . . ,Wp of the p X p sample matrix S = ^MM^ . The k^s and ifj's are known 
as the principal component eigenvalues and eigenvectors, respectively. In Principal 
Component Analysis (PCA) "significant" dimension reduction in the data occurs if 
the first few principal components hjh, ••• account for a "high" proportion of the 
total variance ti S = Y^^=i h ■ 

A common model for the variables My is to assume that they follow a (real) p- 
variate Gaussian distribution Np{i2, S) with mean /x and covariance matrix S. Thus 
the columns {M\j , M2j , ■ ■ ■ , MpjY' provide n independent samples for Np{^,T,). 
Using recent results from RMT, much has now been proved about the statistics of 
h, I2, ... as j3, n ^ 00, p/n ^ J € (0, 00), in the case E = /. In particular, we know 
that in the limit, h, appropriately centered and scaled, satisfies the Tracy- Widom 
distribution for the largest eigenvalues of a GOE matrix. However, most interest- 
ing applications involve so-called spiked populations, a terminology introduced by 
Johnstone, i.e. situations where most of the eigenvalues r]i, ...,r]p of S are equal to 
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a common value, say 1, but the first few eigenvalues are greater than 1. Thus 

771 > 772 > • • • > ??/c > r]k+i = ••• = % = ! 

for some fixed k « p. It is a major problem in multivariate analysis to analyze 
the statistics of the eigenvalues li,l2, ■■■ as p, n — > 00, p/n (0,oo) for such 

spiked populations. There are very interesting phase transitions in the theory. For 
example if r]i > 1 + .^, then, asp ~ n — > 00, Zi emerges from the Marchenko-Pastur 
continuum ((1/7 ~ 1)^; (^/7 + 1)^)) where most of the Ij's tend to accumulate, and 
almost surely 

In the spiked, complex case, i.e. when the columns {Mij,M2j. ■■■,Mpj)'^ are sam- 
pled from the complex p-variate Gaussian distribution, much is known about the 
asymptotic distribution of the Z^'s, as p, n — > 00, p/n — !■ 7 G (0, 00). By contrast in 
the real case, apart from a.s. convergence of the Z^'s, very little is known about their 
asymptotic distributions. In the spiked, complex case the analysis is enabled by 
a particular technical tool, the Harish-Chandra-Itzykson-Zuber formula: unfortu- 
nately, no analog of this formula is known in the real case. While there are relatively 
few applications of complex spiked populations, knowledge of the asymptotic dis- 
tributions of the /i's for real spiked populations would have immediate applications 
to a wide variety of problems in signal processing, genetics and finance. 

Problem 6. /3-ensembles. Random point processes corresponding to /3- 
ensembles, or, equivalently, log gases at inverse temperature (3, are defined for 
arbitrary /3 > 0. The orthogonal, unitary, and symplectic ensembles correspond- 
ing to /3 = 1, 2, or 4, respectively, are now, of course, well understood, but other 
values of (3 are also believed to be relevant in applications, for example, in the sta- 
tistical description of headway in freeway traffic. For certain rational values of /3, 
/3-ensembles are related to Jack polynomials, but for general much less is known. 
The analysis of /3-ensembles for general /3 represents an interesting, and increasingly 
important, challenge. 

Recently there have been significant developnic;nts in the theory of general /3- 
ensembles. As a result of the work of Edelman and Dumitriu, and also others, 
we now know that for all /3 > 0, there exist (tridiagonal) random matrix models 
whose eigenvalues are distributed according to /3-enscmbles. Furthermore, taking 
an appropriate scaling limit of these tridiagonal matrix ensembles, one arrives at 
the following remarkable fact. Let B{x), x > 0, denote Brownian motion, and 
for any /3 > 0, let H/^ denote the Schrodinger operator ^ — x — -^dB{x) acting 

on _L^((0, 00), with Dirichlet boimdary conditions at x = 0. Then (Edelman- 
Sutton, Ramirez-Rider- Virag) for almost all realizations B{x), a; > 0, Hjj is self- 
adjoint with discrete spectrum Xi{B,/3) > X2{B,/3) > ■ ■ ■ and for each k, \k{B,(3) 
has precisely the same distribution as the fc**^ largest eigenvalues of the correspond- 
ing /3-ensemble in the standard edge scaling limit. Part of the challenge in analyzing 
general /3-ensembles is to use to obtain information about these ensembles. Al- 
ready the variational characterization of the Afe's has been used to give simple proofs 
of bounds on the asymptotics of the distributions of the A^'s. Question: can one 
derive the Tracy- Widom formula for Ai(B; (3 = 2), say, directly from Hp! 
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At a more conceptual level, the (/3-ensemble H^) correspondence brings ran- 
dom matrix theory front and center into the arena and practice of modern day 
probability theory. 

Problem 7. Non-self adjoint spectral problems. Much is now known, the- 
oretically and also numerically, about the spectrum of self-adjoint operators. This 
is in great contrast to the situation regarding non-self-adjoint operators, where the 
spectral theory is far more subtle and numerical schemes must overcome signifi- 
cant, inherent instabilities. The heart of the difficulties and instabilities lies in the 
following fact: in the self-adjoint case A = A*, the resolvent {A — A)~^ is bounded 
by the distance of A to the spectrum of A, but in the non-self-adjoint case, this is 

To n1 

no longer true, as we see already in the 2x2 case, A= ^ ^ , n — > oo. In recent 

years, a number of authors (e.g. Trefethan, Davios, ...) have initiated a systematic 
approach to non-self-adjoint spectral problems, notions such as pseudospectrum 
have come into prominence, and other authors have conducted in depth studies of 
particular non-self-adjoint spectrum problems which arise in practice. For example, 
in analyzing the semi-classical limit of the focusing NLS equation, one must analyze 
the spectrum of the associated AKNS operator T{h), as Planck's constant h goes 
to zero. The operator is non-sclf-adjoint, and as h goes to zero more and more 
eigenvalues, corresponding to solitons, emerge in the complex plane. It is of critical 
importance to the analysis of the semi-classical limits for NLS to determine where 
in the plane, and at what rate, the eigenvalues accumulate. The difficulty in doing 
this, theoretically and numerically, is illustrated by the following fact: the spectrum 
of T{h) off the real axis is a discrete set, whereas the numerical range of T{h) is an 
open subset of the plane. Nevertheless, every point A lying in the numerical range 
of T{h) is an eigenvalue of T{h) to all orders in h, \\{T{h) - X)u\\ = 0{\h\'') for 
any A; > 1, for some u = u{h), \\u\\ = 1. In the language of Kruskal, the compu- 
tation of the spectrum of T{h) is a problem "beyond all orders." Much has been 
done (Kamvissis-Miller-McLaughlin, Tovbis-Venakides-Zhou,...) in analyzing the 
semi-classical limit for NLS in special situations where the spectral problem can be 
solved explicitly. The spectral problem with general data, both for NLS and also 
other related non-self-adjoint problems, however, is far from understood, and poses 
a great challenge whose resolution is still only in the initial stages. 

Problem 8. Long-time behavior with non-generic initial data. The 

long-time behavior of the solution of the Cauchy problem for a great many inte- 

grable systems on the line is now well-understood, using, for example, the Ricmann- 
Hilbert/steepest descent method. The method depends on full knowledge of the 
nature of the spectrum of the associated Lax operator. For systems such as KdV, 
MKdV, dcfocusing NLS and the Toda lattice, for example, one is able to describe 
the solution asymptotically in complete detail for general initial data. But for fo- 
cusing NLS, where the associated AKNS Lax operator T is non-self-adjoint (here 
we make contact with Problem 7), the situation is different. For generic T (i.e. an 
open dense set of T's in any reasonable topology) the spectrum consists of the real 
line, where the spectrum is absolutely continuous, together with a finite number 
of simple eigenvalues (corresponding to solitons) off the real axis. The analysis of 
the long-time behavior of focusing NLS with such generic initial data proceeds in 
a straightforward manner similar to KdV, MKdV, etc. For general initial data. 
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however, the situation is more comphcated. For example, let zq > be any pos- 
itive number and let D be any arbitrarily small open disk in the complex plane 
centered at zq, such that D <z {z : 3tz > 0}. Let denote the intersection 
of D with the upper half plane, and let u{z) be an arbitrary function analytic in 
-D"*", and continuous in D+ . Let B — {z ^ : u{z) — 0}. Then there exists 
(Zhou) an AKNS operator T with infinitely smooth, rapidly decaying coefficients 
with the property that each point in B is anL2(M)-ei genvaluc of T. In other words, 
there exist (non-generic) operators T with Schwartz space coefficients which have 
spectrum acciimulating on the real line at an essentially arbitrary rate. It is a 
very interesting question to determine what effect such singularities would have on 
the long-time behavior of the solution of NLS. In particular, recalling that focusing 
NLS provides a model for data transmission along communication cables, are the 
effects measurable? 

The difficulty that we encounter here is not limited to situations where the as- 
sociated Lax operator is non-self-adjoint. Even in situations when the associated 
operator is self-adjoint, but of order greater than two, similar difficulties can arise. 
This is true, in particular, for the Boussinesq equation, where the associated Lax 
operator is third order. The long-time behavior of the solutions of the Boussinesq 
equation with general initial data is a very interesting problem with many chal- 
lenges. Even in the case with generic initial data the situation is only partially 
understood. 



Problem 9. The parking problem. A number of so-called "transportation" 

problems have now been analyzed in terms of RMT. These include: the "vicious" 
walker problem of M. Fisher, the bus problem in Cuernavaca, Mexico, the headway 
traffic problem on highways, and the airline boarding problem of Bachmat et al. 
Recently, researchers in London, Prague, and also Ann Arbor, have noticed an 
intriguing phenomenon. They found that the fluctuations in the spacings between 
cars parked on a long street exhibited RMT behavior. Furthermore, Seba found 
that there was a difference whether the street is two-way or one-way (On a two-way 
street, the cars park only on the right, while on a one-way street one of course has 
the option of also parking on the left.) Quite remarkably, for two-way streets Seba 
found GUE statistics, but for left-side parking on one-way streets he found GOE 
statistics. It is a great challenge to develop a microscopic model for the parking 
problem, in analogy, perhaps, with the microscopic model introduced by Baik et al. 
to explain the RMT statistics for the bus problem in Cuernavaca. Seba's recent, 
intriguing calculations on the parking problem can be found posted on the web. 



Problem 10. A Tracy- Widom Central Limit Theorem. The fact that 
RMT, and the Tracy- Widom distributions, arise in so many problems in so many 
different areas leads one to the following question: how can one characterize RMT in 
purely probabihstic terms? For example, we know that if we take i.i.d.'s (ai, 02, ...), 
add them up, and then center and scale appropriately, 

{ai,a2, ■■■) {Si, S2, ■■■), Sn = — ^ , 

then as n ^ 00, Sn converges in distribution to a Gaussian random variable: this 
is the famous Central Limit Theorem. The analogous situation for RMT is the 
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following; take i.i.d.'s (ai,a2, ...), perform an operation X on them, 

(ai,a2, ...) {Xi,X2, ...), 

and as n ^ oo the X„'s converge to the Gaudin distribution, or the Tracy- Widom 
distribution. The question is, "What is X?" Important progress towards answer- 
ing this question has been made recently, and independently, by Baik-Suidan and 
Bodineau-Martin, but the full problem remains open and very challenging. 

Problem 11. The Toda lattice with random initial data. A fundamental 
question in numerical analysis is the following: how long does it take on average to 
diagonalize a random symmetric nx n matrix Ml There are different opinions about 
what is meant by random, and also many eigenvalue algorithms that one could 
choose. Also one must specify the accuracy that is required. For definitiveness, let 
us assume that M is chosen from GOE and that we use the standard QR eigenvalue 
algorithm. The fundamental question then takes the following more concrete form: 
given e > 0, how many QR steps does it take on average to compute the eigenvalues 
of a GOE random matrix to order e? In practice, one never computes the eigenvalues 
of a full nx n matrix M. Rather one first reduces M to a tridiagonal matrix 

"ai 6i 

J ^ bi 02 

'■• bn-l 
bn-1 an 

with the same spectrum as M using, say, a succession of Householder transforma- 
tions. Now it turns out, by pure serendipity, that the Householder transformation is 
eminently compatible with GOE and the statistics of ,/ can be computed explicitly: 
the a,'s and bj's are independent, the a^'s are i.i.d. Gaussians, and the b/s have x 
distributions, Xn-j, ^ < j < n — 1. Such matrices are an example of what we may 
call a TEl (triangular ensemble, /3 = 1). With these comments, the fundamental 
question now takes the following sharp form: given e > 0, how many QR steps does 
it take on average to compute the eigenvalues of a TEl matrix J to order e? 

In another, independent development in the 1970s, Flaschka, and also Manakov, 
showed that the Toda lattice, consisting of n 1-dimensional particles interacting 
with exponential forces 

(5) Xk = e'"'-^-^" - 6^"-^"+' , l<k<n, 

{xo = —00, Xn+1 = +oo) is a completely integrable Hamiltonian system with a Lax 
operator given by the tri-diagonal matrix 

"ai bi 

J = ^1 

bn-1 
bn-1 a-n 

where 

j ai = —hxi, 1 <i <n 

\ bi= ie2(^*-^i+i), 1 < i < n - 1. 
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With these variables, ([5]) takes the isospectral form 
(6) ^ = [S, J] = BJ- JB, 

where 

" bi 



bn-l 

-bn-i _ 

Subsequently Moser integrated ([5]) explicitly and showed that as t — > oo the particles 
become free, 

J Xi{t) — Xi{oo) + o(l), 1 < i < n 

\ Xi{t) — tXi{oo) + Xi{oo) + o(l), 1 < i < n, 

for some constants {0:^(00)}"^^ and {±^(00)}"^^, where i„(oo) > a;„_i(oo) > 
••• > xi{oo). Said differently, as t 00, J{t) converges to a diagonal matrix 

diag ^— . . . , — ^ . But as J{t) undergoes an iso-spectral evolution, it fol- 
lows that ai(oo) = _^^Aoo)_^ 1 < i < n, must be the eigenvalues of the original matrix 
J(t = 0), where ai(0) = 1 < i < n, bj{0) = iei(^^(o)'^j+i(o)), 1 <j< n-1. 

Deift-Li-Nanda-Tomei then raised the possibility of using the Toda lattice as an 
eigenvalue algorithm (the "Toda algorithm"): given a tridiagonal matrix Jq and 
e > 0, solve ([B]) with initial condition J(t = 0) = Jq, until maxi<i<„ bi(t) < e. Then 
(ii{t),l < i < are the eigenvalues of Jq to order e. Using earlier work of Symes, 
Deift et al showed that the QR algorithm itself was the time-Zc map, k = 1,2,..., 
of a completely integrable Hamiltonian system which Poisson commutes with the 
Toda Hamiltonian. In this picture, the choice of an eigenvalue algorithm becomes 
simply the choice of a Hamiltonian vector field. In view of the above comments 
and remarks, the fundamental question becomes: how long does it take on average 
for Toda particles with TEl initial data to become free (i.e. maxi<i<„_i bi < e)? 
Taking into account the ideas of deflation, it is enough to consider the time t at 
which just bn-i{t) < e. 

The solution of the fundamental question in the above form is a fascinating 
challenge and would clearly be of great interest to scientists and engineers alike. 

Problem 12. Perturbation theory for infinite dimensional integrable 
systems. The bijective mapping properties of the scattering transform for a variety 
of integrable systems on the line between appropriate weighted Sobolev spaces 
have now been established (X. Zhou) using Riemann-Hilbert techniques. This has 
made it possible, in particular, to analyze (Deift-Zhou) the long-time behavior 
of the solution of the Cauchy problem for a variety of integrable systems in a 
fixed space without a "loss of derivatives" . This in turn has made it possible to 
analyze perturbations of integrable systems. For example, Deift-Zhou analyzed the 
perturbed defocusing NLS equation 

(7) iut + Uxx-'2\u\'^u-eV{\u\)u = 

u{x, t = 0)^ uoix) e H''' = {/ e I /', xf G L^(R)} , 

where l^(|w|) ~ \u\p as \u\ — > 0, for some p > 2 sufficiently large. In addition to a 
full description of the long-time behavior of the solution of ([7|, a rather surprising 
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outcome of their calculations is that ([7]) remains completely integrable for all e 
(small and) positive. In the perturbation theory of the linear Schrodinger equation 
- when the term — 2|upw is absent from ([7]) - the key role in the analysis is played 
by the Fourier transform which diagonalizes the linear part of the equation. For the 
full equation ([71), the role of the Fourier transform is now played by the scattering 
transform which "diagonalizes" the (cubic) NLS part of the equation. In the case of 
the linear Schrodinger equation, the method relies on certain precise estimates that 
one obtains from the Fourier transform and the classical steepest descent method; in 
the fully non- linear case ([7]), the analogs of these estimates are obtained from the 
Riemann-Hilbert version of the scattering/inverse scattering transform, together 
with the steepest descent method for RH problems. One shows, in particular, that 
limt^oo(sup^gj^ |u(a;,t)|) = 0, which implies that as t — *■ oo the perturbation term 
eF(|u|)u ~ elwI^M, p > 2, becomes small with respect to the NLS term 2|upw. 

It is of great interest to consider perturbations of type ([7]) for the focusing (cubic) 
NLS equation, i.e. where the term — 2|upu is replaced by +2|upu. This changes the 
problem fundamentally as the focusing equation has soliton solutions. Such solu- 
tions do not decay uniformly in time (i.e. supj ^.gjj \u{x, t)\ > 0) and this complicates 
the analysis of the perturbed equation enormously, as the perturbation term is no 
longer small with respect to 2|upu (see above). Equation ^ in the focusing case 
is just one example of many and varied systems in PDF/Mathematical Physics. In 
higher dimensions, the behavior of such perturbed systems in the neighborhood of 
a simple non-linear bound state is well understood (Soffer, Weinstein,...), but for 
more than one bound state, very little is known. 

The solution of ([7]) in the focusing case in the neighborhood of a fc-soliton 
for (cubic) NLS {k > 2), together with a detailed description of the long-time 
asymptotics, would be regarded as a very significant development in the theory of 
PDEs/Mathematical Physics. In a certain sense, one expects ([7]) once again to be 
integrable for small e. Equation ([7]) is of course only one of many similar systems 
one could consider. For example, one could consider whether the Toda shock and 
Toda rarefaction phenomena persist when the exponential forces of interaction are 
perturbed: all numerical evidence indicates that this is so. Also, one could consider 
perturbations of the sine-Gordon equation as a model for the Fermi-Pasta-Ulam 
problem. 

There is a considerable body of work (Kuksin, Kapeller-Poschel) on Hamiltonian 
perturbations of integrable systems such as NLS or KdV in the spatially periodic 
case. Here KAM methods apply and the authors show that certain finite dimen- 
sional tori corresponding to finite gap solutions in the integrable case survive under 
perturbation. The periodic problem is more complicated than the problem on the 
line because of the action of dispersion. On the whole line dispersion forces soliton- 
free solutions u{x,t) to decay, s\xp^^g^u{x,t) ^ as t — > oo, and even for solutions 
with solitons (cf. comments above) we still expect decay away from the location 
of the solitons. In the periodic case, dispersion has no "room" to act and the 
perturbation term £V^(|u|)u is of the same order as for all time. 

Problem 13. Perturbation theory for exactly solvable combinatorial 
problems. The asymptotic behavior of a variety of combinatorial problems has 
now been analyzed in great detail. Here we have in mind Ulam's problem for 
the length of the longest increasing subsequence, the tiling problem for the Aztec 
diamond, the hexagon tiling problem, and the last passage percolation problem, 
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amongst many others. In all cases, in an appropriate scaling limit, the statistical 
fluctuations in the systems at hand are described by random matrix theory. The 
asymptotic analysis, however, depends in a critical, and rigid way, on the under- 
lying probability measures for the systems. For example, the analysis of the last 
passage percolation problem requires the waiting time Wi at each site i to be either 
geometrically or exponentially distributed (Johansson): if the statistics of the wait- 
ing times is neither geometric nor exponential, the analysis fails completely. What 
happens if the geometric distribution, say, is slightly perturbed? The challenge 
here is to develop an eS'ective perturbation theory for such systems. One expects 
that the random matrix behavior of the fluctuations should persist. Related work 
on this problem has been done by Baik-Suidan and Bouchard-Martin (cf. Problem 
10). 

Problem 14. Initial/boundary value problems for integrable systems. 

In the late 1990s Fokas introduced a new, more flexible approach to inverse scatter- 
ing theory, and in recent years a number of researchers (Fokas, Its, Anne Boutet 
dc Monvcl, Shcpclsky,...) have applied Fokas' approach to the initial/boundary 
value problem for various integrable systems such as NLS. Much progress has been 
made, but there is a basic, and puzzling, obstacle to applying the method, viz. 
one needs to know certain dependent data in order to proc;ced. For example, the 
initial/boundary value problem for NLS is well-posed if one gives the initial data 
u{x,0) = Uo{x), X > 0, and the boundary data ^(0,^) = ui{t), t > 0. However, 
in implementing the method it turns out that one needs to know the dependent 
data Ux{0,t), i > 0, as such information appears explicitly in the solution formu- 
lae. Progress has been made in determining M,,.(0,t), t > 0, from Uo{x) and ui{t) 
intrinsically via the method, but the control one obtains on ^^^(O, t) is not sufficient 
in order to obtain the long-time behavior of the initial/boundary value problem. 
This is true even in simple cases, such as the following: suppose ^(a::, t) solves NLS 
in (x > 0,t > 0) with u{0,t) = sin(a>t), lo ^ 0, and u{x,0) — uq{x), where uq is 
smooth with compact support. How does u{x,t) behave as t oo? The solution 
of this problem for any such Uo{x) would be a very significant development in the 
theory, and would also be of considerable interest in science and engineering. 

There is a philosophical point at stake here. The evolution of NLS in a; > 
represents the interplay of forces which are "integrable" at some fundamental, 
algebraic level, and indeed, if no other forces are present, as in the full-line scattering 
or periodic cases, the equation can be integrated explicitly. When one considers the 
initial/boundary value problem, however, new physical forces come into play which 
describe the interaction of the particles on the boundary with the NLS particles in 
the interior. There is absolutely no a priori guarantee that the enlarged system, 
"NLS particles in x > 0" -|- "particles on the boundary", is integrable. It may 
be that the long-time behavior of the composite system can only be solved for 
"generic. Cantor-set" like data, as is familiar from KAM theory. In other words, an 
explicit description of the long-time behavior of the solution of the initial/boundary 
value problem for NLS for general initial data may not be possible. This is a very 
intriguing situation. 

Problem 15. Multi-matrix models and models with an external field. 

There has been considerable progress (Kuijlaars,...) in understanding basic statis- 
tics such as the correlation functions for the 2-matrix random matrix model, and 
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also matrix models with a source. The key element in these developments has been 
the successful extension by Kuijlaars et al of the Riemann-Hilbert/steepest descent 
method to 3 x 3 Riemann-Hilbert problems. So far only the simplest situations have 
been considered. In order to consider the generic situation, one must, in particular, 
extend the Riemann-Hilbert/steepest descent method to n x n Riemann-Hilbert 
problems. This is a challenging problem which would have important implications, 
not only for random matrix models, but also for problems in other areas, such 
as Pade-Hermite approximations and irrationality questions for distinguished real 
numbers, and multi-orthogonal polynomials. 

Problem 16. Poisson/Gaudin-Mehta transition. On the appropriate scale, 
the bulk eigenvalues of a random GOE matrix M exhibit Gaudin-Mehta statistics. 
As noted in Problem 2, the same is believed to be true for general Wigner matrices. 
On the other hand, if M = M'^ has i.i.d. entries and bandwidth W = 1 (i.e. M 
is tridiagonal), then, on the appropriate scale, the bulk eigenvalues of M exhibit 
Poisson statistics. As the bandwidth W increases from 1 to A/' — 1 = dim(M) — 1, 
the eigenvalue statistics must change from Poisson to Gaudin-Mehta. A back of the 
envelope calculation suggests that there should be a (sharp) transition in a narrow 
region around W « \/]V. This is a well-known, outstanding, open conjecture with 
many implications for theoretical physics, particularly wave propagation in random 
media. There is a substantial body of work on this problem, which can be easily 
traced on the web. 

There are many other areas, closely related to the problems in the above list, 
where much progress has been made in recent years, and where much remains to be 
done. These include: totally asymmetric simple exclusion processes (TASEP) fol- 
lowing the work of Borodin, Sasamoto, Ferrari,..., asymmetric simple exclusion pro- 
cesses (ASEP) following the work of Tracy, Widom,..., orthogonal rational functions 
(Vartanian, Zhou, McLaughlin), partition functions for random matrix models and 
their connections to mappings on Riemann surfaces (Itzikson, Zuber,..., McLaugh- 
lin. Ercolani,..., Guionnet), the representation theory of "large" groups such as 
the infinite symmetric group or the infinite unitary group (Olshanski, Borodin, Ok- 
ounkov,...), and many others. But as I said in the beginning, my list is not complete 
or definitive. 
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